A Causal Inference Model Explains Perception of the McGurk Effect and Other Incongruent Audiovisual Speech
نویسندگان
چکیده
Audiovisual speech integration combines information from auditory speech (talker's voice) and visual speech (talker's mouth movements) to improve perceptual accuracy. However, if the auditory and visual speech emanate from different talkers, integration decreases accuracy. Therefore, a key step in audiovisual speech perception is deciding whether auditory and visual speech have the same source, a process known as causal inference. A well-known illusion, the McGurk Effect, consists of incongruent audiovisual syllables, such as auditory "ba" + visual "ga" (AbaVga), that are integrated to produce a fused percept ("da"). This illusion raises two fundamental questions: first, given the incongruence between the auditory and visual syllables in the McGurk stimulus, why are they integrated; and second, why does the McGurk effect not occur for other, very similar syllables (e.g., AgaVba). We describe a simplified model of causal inference in multisensory speech perception (CIMS) that predicts the perception of arbitrary combinations of auditory and visual speech. We applied this model to behavioral data collected from 60 subjects perceiving both McGurk and non-McGurk incongruent speech stimuli. The CIMS model successfully predicted both the audiovisual integration observed for McGurk stimuli and the lack of integration observed for non-McGurk stimuli. An identical model without causal inference failed to accurately predict perception for either form of incongruent speech. The CIMS model uses causal inference to provide a computational framework for studying how the brain performs one of its most important tasks, integrating auditory and visual speech cues to allow us to communicate with others.
منابع مشابه
Early and late beta-band power reflect audiovisual perception in the McGurk illusion.
The McGurk illusion is a prominent example of audiovisual speech perception and the influence that visual stimuli can have on auditory perception. In this illusion, a visual speech stimulus influences the perception of an incongruent auditory stimulus, resulting in a fused novel percept. In this high-density electroencephalography (EEG) study, we were interested in the neural signatures of the ...
متن کاملEarly and Late Beta Band Power reflect Audiovisual Perception in 1 the McGurk Illusion
33 The McGurk illusion is a prominent example of audiovisual speech perception and the influence, visual 34 stimuli can have on auditory perception. In this illusion a visual speech stimulus influences the 35 perception of an incongruent auditory stimulus resulting in a fused novel percept. In this high-density 36 electroencephalography (EEG) study we were interested in the neural signatures of...
متن کاملA neural basis for interindividual differences in the McGurk effect, a multisensory speech illusion
The McGurk effect is a compelling illusion in which humans perceive mismatched audiovisual speech as a completely different syllable. However, some normal individuals do not experience the illusion, reporting that the stimulus sounds the same with or without visual input. Converging evidence suggests that the left superior temporal sulcus (STS) is critical for audiovisual integration during spe...
متن کاملNeural correlates of interindividual differences in children's audiovisual speech perception.
Children use information from both the auditory and visual modalities to aid in understanding speech. A dramatic illustration of this multisensory integration is the McGurk effect, an illusion in which an auditory syllable is perceived differently when it is paired with an incongruent mouth movement. However, there are significant interindividual differences in McGurk perception: some children ...
متن کاملThe effect of musical aptitude on the integration of audiovisual speech and non-speech signals in children
Multisensory integration was assessed using two audiovisual illusions. In the McGurk effect, auditory speech perception is altered by incongruent visual speech. In the Shams illusion, the number of seen flashes is altered by an incongruent number of heard beeps. The illusions were tested in 10-year-old children, whose musical aptitude was also assessed. The strength of the McGurk effect was not...
متن کامل